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WIRELESS METHODS AND DEVICES EMPLOYING STEGANOGRAPHY 
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Technical Field 

^ The present invention relates to wireless communication systems, such as cellular 

C20 systems and PCS systems, and more particularly relates to methods and systems for reducing 

P 

theft of wireless telephony services by use of steganographically encoded authentication data. 

Background and Summary of The Invention 
(For expository convenience, this disclosure generally refers to cellular telephony 
25 systems. However, it should be recognized that the invention is not so limited, but can be used 
with any wireless communications device, whether for voice or data; analog or digital.) 

In the cellular telephone industry, hundreds of millions of dollars of revenue is lost 
each year through theft of services. While some services are lost due to physical theft of 
cellular telephones, the more pernicious threat is posed by cellular telephone hackers. 
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Cellular telephone hackers employ various electronic devices to mimic the identification 
signals produced by an authorized cellular telephone. (These signals are sometimes called 
authorization signals, verification numbers, signature data, etc.) Often, the hacker learns of 
these signals by eavesdropping on authorized cellular telephone subscribers and recording the 
5 data exchanged with the cell cite. By artful use of this data, the hacker can impersonate an 
authorized subscriber and dupe the carrier into completing pirate calls. 

In the prior art, identification signals are segregated from the voice signals. Most 
commonly, they are temporally separated, e.g. transmitted in a burst at the time of call 
origination. Voice data passes through the channel only after a verification operation has taken 
10 place on this identification data. (Identification data is also commonly included in data packets 
sent during the transmission.) Another approach is to spectrally separate the identification, 
e.g. in a spectral subband outside that allocated to the voice data. 

Other fraud-deterrent schemes have also been employed. One class of techniques 
monitors characteristics of a cellular telephoned RF signal to identify the originating phone. 
Another class of techniques uses handshaking protocols, wherein some of the data returned by 
the cellular telephone is based on an algorithm (e.g. hashing) applied to random data sent 
thereto. 

M 

Combinations of the foregoing approaches are also sometimes employed. 
U.S. Patents 5,465,387, 5,454,027, 5,420,910, 5,448,760, 5,335,278, 5,345,595, 
ko 5,144,649, 5,204,902, 5,153,919 and 5,388,212 detail various cellular telephone systems, and 
fraud deterrence techniques used therein. The disclosures of these patents are incorporated by 
reference. 

As the sophistication of fraud deterrence systems increases, so does the sophistication 
of cellular telephone hackers. Ultimately, hackers have the upper hand since they recognize 
25 that all prior art systems are vulnerable to the same weakness: the identification is based on 

some attribute of the cellular telephone transmission outside the voice data. Since this attribute 
is segregated from the voice data, such systems will always be susceptible to pirates who 
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electronically "patch" their voice into a composite electronic signal having the attribute(s) 
necessary to defeat the fraud deterrence system. 

To overcome this failing, the preferred embodiments of the present invention 
steganographically encodes the voice signal with identification data, resulting in "in-band" 
5 signaling (in-band both temporally and spectrally). This approach allows the carrier to monitor 
the user's voice signal and decode the identification data therefrom. 

In one form of the invention, some or all of the identification data used in the prior art 
(e.g. data transmitted at call origination) is repeatedly steganographically encoded in the user's 
voice signal as well. The carrier can thus periodically or aperiodically check the identification 
10 data accompanying the voice data with that sent at call origination to ensure they match. If 
they do not, the call is identified as being hacked and steps for remediation can be instigated 
such as interrupting the call. 
T\ In another form of the invention, a randomly selected one of several possible messages 

jO is repeatedly steganographically encoded on the subscriber's voice. An index sent to the 
g;15 cellular carrier at call set-up identifies which message to expect. If the message 
*;* steganographically decoded by the cellular carrier from the subscriber's voice does not match 
p that expected, the call is identified as fraudulent. 

j£j In the preferred form of the invention, the steganographic encoding relies on a pseudo 

01 

f^i random data signal to transform the message or identification data into a low level noise-like 
C^O signal superimposed on the subscriber's digitized voice signal. This pseudo random data signal 
is known, or knowable, to both the subscriber's telephone (for encoding) and to the cellular 
carrier (for decoding). Many such embodiments rely on a deterministic pseudo random 
number generator seeded with a datum known to both the telephone and the carrier. In simple 
embodiments this seed can remain constant from one call to the next (e.g. a telephone ID 
25 number). In more complex embodiments, a pseudo-one-time pad system may be used, wherein 
a new seed is used for each session (i.e. telephone call). In a hybrid system, the telephone and 
cellular carrier each have a reference noise key (e.g. 10,000 bits) from which the telephone 
selects a field of bits, such as 50 bits beginning at a randomly selected offset, and each uses 
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this excerpt as the seed to generate the pseudo random data for encoding. Data sent from the 
telephone to the carrier (e.g. the offset) during call set-up allows the carrier to reconstruct the 
same pseudo random data for use in decoding. Yet further improvements can be derived by 
borrowing basic techniques from the art of cryptographic communications and applying them 
5 to the steganographically encoded signal detailed in this disclosure. 

Details of applicant's preferred techniques for steganographic encoding/decoding with a 
pseudo random data stream are more particularly detailed in applicant's prior applications, but 
the present invention is not limited to use with such techniques. A brief review of other 
steganographic techniques suitable for use with the present invention follows. 
10 British patent publication 2,196,167 to Thorn EMI discloses a system in which an audio 

recording is electronically mixed with a marking signal indicative of the owner of the 
recording, where the combination is perceptually identical to the original. U.S. patents 
4,963,998 and 5,079,648 disclose variants of this system. 


yu U.S. Patent 5,319,735 to B.B.N, rests on the same principles as the earlier Thorn EMI 

|45 publication, but additionally addresses psycho-acoustic masking, issues. 

& U.S. Patents 4,425,642, 4,425,661, 5,404,377 and 5,473,631 to Moses disclose 

various systems for imperceptibly embedding data into audio signals — the latter two patents 
particularly focusing on neural network. implementations and perceptual coding details. 
U.S. Patent 4,943,973 to AT&T discloses a system employing spread spectrum 
20 techniques for adding a low level noise signal to other data to convey auxiliary data therewith. 
The patent is particularly illustrated in the context of transmitting network control signals 
along with digitized voice signals. 

U.S. Patent 5,161,210 to U.S. Philips discloses a system in which additional low-level 
quantization levels are defined on an audio signal to convey, e.g., a copy inhibit code, 
25 therewith. 

U.S. Patent 4,972,471 to Gross discloses a system intended to assist in the automated 
monitoring of audio (e.g. radio) signals for copyrighted materials by reference to identification 
signals subliminally embedded therein. 
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There are a variety of shareware programs available on the internet (e.g. "Stego" and 
"White Noise Storm") which generally operate by swapping bits from a to-be-concealed 
message stream into the least significant bits of an image or audio signal. White Noise Storm 
effects a randomization of the data to enhance its concealment. 

A British company, Highwater FBI, Ltd., has introduced a software product which is 
said to imperceptibly embed identifying information into photographs and other graphical 
images. This technology is the subject of European patent applications 9400971.9 (filed 
January 19, 1994), 9504221.2 (filed March 2, 1995), and 9513790.7 (filed July 3, 1995), the 
first of which has been laid open as PCT publication WO 95/20291. 

Walter Bender at M.I. T. has done a variety of work in the field, as illustrate by his 
paper "Techniques for Data Hiding," Massachusetts Institute of Technology, Media 
Laboratory, January 1995. 

Dice, Inc. of Palo Alto has developed an audio marking technology marketed under the 
name Argent. While a U.S. Patent Application is understood to be pending, it has not yet been 
issued. 

Tirkel et al, at Monash University, have published a variety of papers on "electronic 
watermarking" including, e.g., "Electronic Water Mark," DICTA-93, Macquarie University, 
Sydney, Australia, December, 1993, pp. 666-673, and "A Digital Watermark," IEEE 
International Conference on Image Processing, November 13-16, 1994, pp. 86-90. 

Cox et al, of the NEC' Technical Research Institute, discuss various data embedding 
techniques in their published NEC technical report entitled "Secure Spread Spectrum 
Watermarking for Multimedia," December, 1995. 

Moller et al. discuss an experimental system for imperceptibly embedding auxiliary 
data on an ISDN circuit in "Rechnergestutzte Steganographie: Wie sie Funktioniert und warum 
folglich jede Reglementierung von Verschlusselung unsinnig ist," DuD, Datenschutz und 
Datensicherung, 18/6 (1994) 318-326. The system randomly picks ISDN signal samples to 
modify, and suspends the auxiliary data transmission for signal samples which fall below a 
threshold. 


WYCdks 60085 01/05/00 


EXPRESS MAIL EL525676538US 


In addition to the foregoing, many of the other cited prior art patents and publications 
disclose systems for embedding a data signal on an audio signal. These, too, can generally be 
employed in systems according to the present invention. 

The foregoing and additional features and advantages of the present invention will be 
more readily apparent from the following detailed description, which proceeds with reference 
to the accompanying drawings. 

Brief Description Of The Drawings 
Fig. 1 is a block diagram showing principal components of an exemplary wireless 
telephony system. 

Fig. 2 is a block diagram of an exemplary steganographic encoder that can be used in 
the telephone of the Fig. 1 system. 

Fig. 3 is a block diagram of an exemplary steganographic decoder that can be used in 
the cell site of the Fig. 1 system. 

Figs. 4 A and 4B are histograms illustrating signal relationships which may be exploited 
to facilitate decoding. 

Detailed Description 

The reader is presumed to be familiar with cellular communications technologies. 
Accordingly, details known from prior art in this field aren't belabored herein. 

Referring to Fig. 1, an illustrative cellular system includes a telephone 10, a cell site 
12, and a central office 14. 

Conceptually, the telephone may be viewed as including a microphone 16, an A/D 
converter 18, a data formatter 20, a modulator 22, an RF section 24, an antenna 26, a 
demodulator 28, a data unformatter 30, a D/A converter 32, and a speaker 34. 

In operation, a subscriber's voice is picked up by the microphone 16 and converted to 
digital form by the A/D converter 18. The data formatter 20 puts the digitized voice into 
packet form, adding synchronization and control bits thereto. The modulator 22 converts this 
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digital data stream into an analog signal whose phase and/or amplitude properties change in 
accordance with the data being modulated. The RF section 24 commonly translates this time- 
varying signal to one or more intermediate frequencies, and finally to a UHF transmission 
frequency. The RF section thereafter amplifies it and provides the resulting signal to the 
5 antenna 26 for broadcast to the cell site 12. 

The process works in reverse when receiving. A broadcast from the cell cite is 
received through the antenna 26. RF section 24 amplifies and translates the received signal to 
a different frequency for demodulation. Demodulator 28 processes the amplitude and/or 
phase variations of the signal provided by the RF section to produce a digital data stream 
10 corresponding thereto. The data unformatter 30 segregates the voice data from the associated 
synchronization/control data, and passes the voice data to the D/A converter for conversion 
^ into analog form. The output from the D/A converter drives the speaker 34, through which 
4* the subscriber hears the other party's voice. 

41 The cell site 12 receives broadcasts from a plurality of telephones 10, and relays the 

p!5 data received to the central office 14. Likewise, the cell site 12 receives outgoing data from 
& the central office and broadcasts same to the telephones. 

O The central office 14 performs a variety of operations, including call authentication, 

p switching, and cell hand-off. 

jjjj (In some systems, the functional division between the cell site and the central station is 

CjZO different than that outlined above. Indeed, in some systems, all of this functionality is 
provided at a single site.) 

In an exemplary embodiment of the present invention, each telephone 10 additionally 
includes a steganographic encoder 36. Likewise, each cell site 12 includes a steganographic 
decoder 38. The encoder operates to hide an auxiliary data signal among the signals 
25 representing the subscriber's voice. The decoder performs the reciprocal function, discerning 
the auxiliary data signal from the encoded voice signal. The auxiliary signal serves to verify 
the legitimacy of the call. 

An exemplary steganographic encoder 36 is shown in Fig. 2. 
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Y 

The illustrated encoder 36 operates on digitized voice data, auxiliary data, and pseudo- 
random nobe (PRN) data. The digitized voice data is applied at a port 40 and is provided, 
e.g., from A/D converter 18. The digitized voice may comprise 8-bit samples. The auxiliary 
data is applied at a porN2 and comprises, in one form of the invention, a stream of binary data 
uniquely identifying the telephone 10. (The auxiliary data may additionally include 
administrative data of the sort convfeqtionally exchanged with a cell site at call set-up.) The 
pseudo-random noise data is applied at ajtert 44 and can be, e.g., a signal that randomly 
alternates between "-1" and "1" values. (More aad more cellular phones are incorporating 
spread spectrum capable circuitry, and this pseudo-rah^pm noise signal and other aspects of 
this invention can often Apiggy-bacls or share the circuit^>which is already being applied in 
the basic operation of a cellular unit). 

For expository convenience, it is assumed that all three data signals applied to the 
encoder 36 are clocked at a common rate, although this is not necessary in practice. 

In operation, the auxiliary data and PRN data streams are applied to the two inputs of a 
logic circuit 46. The output of circuit 46 switches between -1 and + 1 in accordance with the 
following table: 


AUX 

PRN 

OUTPUT 

0 

-1 

1 

0 

1 

-1 

1 

-1 

-1 

1 

1 

1 


(If the auxiliary data signal is conceptualized as switching between -1 and 1, instead of 0 and 1, 
it will be seen that circuit 46 operates as a one-bit multiplier.) 

The output from gate 46 is thus a bipolar data stream whose instantaneous value 
changes randomly in accordance with the corresponding values of the auxiliary data and the 
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PRN data. It may be regarded as noise. However, it has the auxiliary data encoded therein. 
The auxiliary data can be extracted if the corresponding PRN data is known. 

The noise-like signal from gate 46 is applied to the input of a scaler circuit 48. Scaler 
circuit scales (e.g. multiplies) this input signal by a factor set by a gain control circuit 50. In 
5 the illustrated embodiment, this factor can range between 0 and 15. The output from scaler 
circuit 48 can thus be represented as a five-bit data word (four bits, plus a sign bit) which 
changes each clock cycle, in accordance with the auxiliary and PRN data, and the scale factor. 
The output from the scaler circuit may be regarded as "scaled noise data" (but again it is 
"noise" from which the auxiliary data can be recovered, given the PRN data). 
10 The scaled noise data is summed with the digitized voice data by a summer 51 to 

provide the encoded output signal (e.g. binarily added on a sample by sample basis). This 

t; 

; ; output signal is a composite signal representing both the digitized voice data and the auxiliary 
; data. 

The gain control circuit 50 controls the magnitude of the added scaled noise data so its 
15 addition to the digitized voice data does not noticeably degrade the voice data when converted 
4« to analog form and heard by a subscriber. The gain control circuit can operate in a variety of 
P ways. 

fs One is a logarithmic scaling function. Thus, for example, voice data samples having 

jl 5 decimal values of 0, 1 or 2 may be correspond to scale factors of unity, or even zero, whereas 
C20 voice data samples having values in excess of 200 may correspond to scale factors of 15. 

Generally speaking, the scale factors and the voice data values correspond by a square root 
relation. That is, a four-fold increase in a value of the voice data corresponds to 
approximately a two-fold increase in a value of the scaling factor associated therewith. 
Another scaling function would be linear as derived from the average power of the voice 
25 signal. 

(The parenthetical reference to zero as a scaling factor alludes to cases, e.g., in which 
the digitized voice signal sample is essentially devoid of information content.) 
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More satisfactory than basing the instantaneous scaling factor on a single voice data 
sample, is to base the scaling factor on the dynamics of several samples. That is, a stream of 
digitized voice data which is changing rapidly can camouflage relatively more auxiliary data 
than a stream of digitized voice data which is changing slowly. Accordingly, the gain control 
5 circuit 50 can be made responsive to the first, or preferably the second- or higher-order 
derivative of the voice data in setting the scaling factor. 

In still other embodiments, the gain control block 50 and scaler 48 can be omitted 
entirely. 

(Those skilled in the art will recognize the potential for "rail errors" in the foregoing 
10 systems. For example, if the digitized voice data consists of 8-bit samples, and the samples 
span the entire range from 0 to 255 (decimal), then the addition or subtraction of scaled noise 
^ to/from the input signal may produce output signals that cannot be represented by 8 bits (e.g. - 
=P 2, or 257). A number of well-understood techniques exist to rectify this situation, some of 
yy them proactive and some of them reactive. Among these known techniques are: specifying that 
Jpl5 the digitized voice data shall not have samples in the range of 0-4 or 241-255, thereby safely 

permitting combination with the scaled noise signal; and including provision for detecting and 
p adaptively modifying digitized voice samples that would otherwise cause rail errors.) 
n Returning to the telephone 10, an encoder 36 like that detailed above is desirably 

interposed between the A/D converter 18 and the data formatter 20, thereby serving to 
CS0 steganographically encode all voice transmissions with the auxiliary data. Moreover, the 
circuitry or software controlling operation of the telephone is arranged so that the auxiliary 
data is encoded repeatedly. That is, when all bits of the auxiliary data have been encoded, a 
pointer loops back and causes the auxiliary data to be applied to the encoder 36 anew. (The 
auxiliary data may be stored at a known address in RAM memory for ease of reference.) 
25 It will be recognized that the auxiliary data in the illustrated embodiment is transmitted 

at a rate one-eighth that of the voice data. That is, for every 8-bit sample of voice data, scaled 
noise data corresponding to a single bit of the auxiliary data is sent. Thus, if voice samples are 
sent at a rate of 4800 samples/second, auxiliary data can be sent at a rate of 4800 bits/second. 
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If the auxiliary data is comprised of 8-bit symbols, auxiliary data can be conveyed at a rate of 
600 symbols/second. If the auxiliary data consists of a string of even 60 symbols, each second 
of voice conveys the auxiliary data ten times. (Significantly higher auxiliary data rates can be 
achieved by resorting to more efficient coding techniques, such as limited-symbol codes (e.g. 
5- or 6-bit codes), Huffman coding, etc.) This highly redundant transmission of the auxiliary 
data permits lower amplitude scaled noise data to be used while still providing sufficient 
signal-to-noise headroom to assure reliable decoding - even in the relatively noisy 
environment associated with radio transmissions. 

Turning now to Fig. 3, each cell site 12 has a steganographic decoder 38 by which it 
can analyze the composite data signal broadcast by the telephone 10 to discern and separate the 
auxiliary data and digitized voice data therefrom. (The decoder desirably works on 
unformatted data (i.e. data with the packet overhead, control and administrative bits removed; 
this is not shown for clarity of illustration). 

The decoding of an unknown embedded signal (i.e. the encoded auxiliary signal) from 
an unknown voice signal is best done by some form of statistical analysis of the composite data 
signal. 

In one approach, decoding relies on recombining the composite data signal with PRN 
data (identical to that used during encoding), and analyzing the entropy of the resulting signal. 
"Entropy" need not be understood in its most strict mathematical definition, it being merely 
the most concise word to describe randomness (noise, smoothness, snowiness, etc.). 

Most serial data signals are not random. That is, one sample usually correlates - to 
some degree - with adjacent samples. This is true in sampled voice signals. 

Noise, in contrast, typically is random. If a random signal (e.g. noise) is added to (or 
subtracted from) a non-random signal (e.g. voice), the entropy of the resulting signal generally 
increases. That is, the resulting signal has more random variations than the original signal. 
This is the case with the composite data signal produced by encoder 36; it has more entropy 
than the original, digitized voice data. 
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If, in contrast, the addition of a random signal to (or subtraction from) a non-random 
(e.g. voice) signal reduces entropy, then something unusual is happening. It is this anomaly 
that can be used to decode the composite data signal. 

To fully understand this entropy-based decoding method, it is first helpful to highlight a 
5 characteristic of the original encoding process: the similar treatment of every Nth (e.g. 480th) 
sample. 

In the encoding process discussed above, the auxiliary data is 480 bits long. Since it is 
encoded repeatedly, every 480th sample of the composite data signal corresponds to the same 
bit of the auxiliary data. If this bit is a "1", the scaled PRN data corresponding thereto are 
10 added to the digitized voice signal; if this bit is a "0", the scaled PRN data corresponding 

thereto are subtracted. Due to the repeated encoding of the auxiliary data, every 480th sample 
^1 of the composite data signal thus shares a characteristic: they are all either augmented by the 
4* corresponding noise data (which may be negative), or they are all diminished, depending on 
yfj whether the bit of the auxiliary data is a "1" or a "0". 

^15 To exploit this characteristic, the entropy-based decoding process treats every 480th 

2P 

sample of the composite signal in like fashion. In particular, the process begins by adding to 
CI the 1st, 481st, 861st, etc. samples of the composite data signal the PRN data with which these 

S t 

samples were encoded. (That is, a set of sparse PRN data is added: the original PRN set, with 
^ all but every 480th datum zeroed out.) The localized entropy of the resulting signal around 
C£0 these points (i.e. the composite data signal with every 480th sample modified) is then 

computed. 

(Computation of a signals entropy or randomness is well understood by artisans in this 
field. One generally accepted technique is to take the derivative of the signal at each sample 
point near a point in question (e.g. the modified sample and 4 samples either side), square 
25 these values, and then sum the resulting signals over all of the localized regions over the entire 
signal. A variety of other well known techniques can alternatively be used.) 

The foregoing step is then repeated, this time subtracting the PRN data corresponding 
thereto from the 1st, 481st, 961st, etc. composite data samples. 
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One of these two operations will counteract (e.g. undo) the encoding process and 
reduce the resulting signal's entropy; the other will aggravate it. If adding the sparse PRN 
data to the composite data reduces its entropy, then this data must earlier have been subtracted 
from the original voice signal. This indicates that the corresponding bit of the auxiliary data 
5 signal was a "0" when these samples were encoded. (A "0" at the auxiliary data input of logic 
circuit 46 caused it to produce an inverted version of the corresponding PRN datum as its 
output datum, resulting in subtraction of the corresponding PRN datum from the voice signal.) 

Conversely, if subtracting the sparse PRN data from the composite data reduces its 
entropy, then the encoding process must have earlier added this noise. This indicates that the 
10 value of the auxiliary data bit was a "1" when samples 1, 481, 961, etc., were encoded. 

By noting in which case entropy is lower by (a) adding or (b) subtracting a sparse set of 
^ PRN data to/from the composite data, it can be determined whether the first bit of the auxiliary 
=P data is (a) a "0", or (b) a "1." (In real life applications, in the presence of various distorting 

phenomena, the composite signal may be sufficiently corrupted so that neither adding nor 
^5 subtracting the sparse PRN data actually reduces entropy. Instead, both operations will 
4* increase entropy. In this case, the "correct" operation can be discerned by observing which 

s 

p operation increases the entropy less.) 

U 

p The foregoing operations can then be conducted for the group of spaced samples of the 

0 1 composite data beginning with the second sample (i.e. 2, 482, 962, ...). The entropy of the 

o 

f$0 resulting signals indicate whether the second bit of the auxiliary data signal is a "0" or a "1." 
Likewise with the following 478 groups of spaced samples in the composite signal, until all 
480 bits of the code word have been discerned. 

It will be appreciated that the foregoing approach is not sensitive to corruption 
mechanisms that alter the values of individual samples; instead, the process considers the 
25 entropy of spaced excerpts of the composite data, yielding a high degree of confidence in the 
results. 

A second and probably more common decoding technique is based on correlation 
between the composite data signal and the PRN data. Such operations are facilitated in the 
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present context since the auxiliary data whose encoded representation is sought, is known, at 
least in large part, a priori. (In one form of the invention, the auxiliary data is based on the 
authentication data exchanged at call set-up, which the cellular system has already received and 
logged; in another form (detailed below), the auxiliary data comprises a predetermined 
5 message.) Thus, the problem can be reduced to determining whether an expected signal is 
present or not (rather than looking for an entirely unknown signal). Moreover, data formatter 
20 breaks the composite data into frames of known length. (In a known GSM implementation, 
voice data is sent in time slots which convey 114 data bits each.) By padding the auxiliary data 
as necessary, each repetition of the auxiliary data can be made to start, e.g., at the beginning 
10 of such a frame of data. This, too, simplifies the correlation determinations, since 113 of 
every 114 possible bit alignments can be ignored (facilitating decoding even if none of the 
% auxiliary data is known a priori). 

^« Classically speaking, the detection of the embedded auxiliary data fits nicely into the 

"V: 

4s old art of detecting known signals in noise. Noise in this last statement can be interpreted very 
j*|5 broadly, even to the point where the subscribers voice can be considered noise, relative to the 

need to detect the underlying auxiliary data. One of many references to this older art is the 
C3 book Kassam, Saleem A., "Signal Detection in Non-Guassian Noise," Springer-Verlag, 1988 

Li. 

□ (available at the Library of Congress by catalog number TK5 102.5 .K357 1988). 

In particular, section 1.2 "Basic Concepts of Hypothesis Testing" of Kassam' s book 
C20 lays out the basic concept of a binary hypothesis, assigning the value "1" to one hypothesis and 
the value "0" to the other hypothesis. The last paragraph of that section is also on point 
regarding the initial preferred embodiment of this invention, i.e., that the "0" hypothesis 
corresponds to "noise only" case, whereas the "1" corresponds to the presence of a signal in 
the observations. In the current preferred embodiment, the case of "noise-only" is effectively 
25 ignored, and that an identification process will either come up with our N-bit identification 
word or it will come up with "garbage. " 

The continued and inevitable engineering improvement in the detection of embedded 
code signals will undoubtedly borrow heavily from this generic field of known signal 


- 14- 


WYC:dks 60085 01/05/00 


EXPRESS MAIL EL525676538US 


detection. A common and well-known technique in this field is the so-called "matched filter," 
which is incidentally discussed early in section 2 of the Kassam book. Many basic texts on 
signal processing include discussions on this method of signal detection. This is also known in 
some fields as correlation detection. Where, as here, the location of the auxiliary signal is 
5 known a priori (or more accurately, known to fall within one of a few discrete locations, as 
discussed above), then the matched filter can often be reduced to a simple vector dot product 
between a set of sparse PRN data, and mean-removed excerpts of the composite signal 
corresponding thereto. (Note that the PRN data need not be sparse and may arrive in 
contiguous bursts, such as in British patent publication 2,196,167 mentioned earlier wherein a 
10 given bit in a message has contiguous PRN values associated with it.) Such a process steps 

through all 480 sparse sets of PRN data and performs corresponding dot product operations. If 
% the dot product is positive, the corresponding bit of the auxiliary data signal is a "1;" if the dot 

ar 

product is negative, the corresponding bit of the auxiliary data signal is a "0." If several 
yy alignments of the auxiliary data signal within the framed composite signal are possible, this 
pl5 procedure is repeated at each candidate alignment, and the one yielding the highest correlation 

is taken as true. (Once the correct alignment is determined for a single bit of the auxiliary data 


Q signal, the alignment of all the other bits can be determined therefrom. AAlignment,= perhaps 
g better known as Asynchronization,= can be achieved by primarily through the very same 

mechanisms which lock on and track the voice signal itself and allow for the basic functioning 


C§0 of the cellular unit). 

One principle which did not seem to be explicitly present in the Kassam book and 
which was developed rudimentarily by the inventor involves the exploitation of the magnitudes 
of the statistical properties of the auxiliary data signal being sought relative to the magnitude of 
the statistical properties of the composite signal as a whole. In particular, the problematic case 
25 seems to be where the auxiliary data signals we are looking for are of much lower level than 
the noise and corruption present on a difference signal between the composite and digitized 
voice signals. Fig. 4 attempts to set the stage for the reasoning behind this approach. Fig. 4 A 
contains a generic look at the differences in the histograms between a typical "problematic" 
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difference signal, i.e., a difference signal which has a much higher overall energy than the 
auxiliary data that may or may not be within it. The term "mean-removed" simply means that 
the means of both the difference signal and the auxiliary data have been removed, a common 
operation prior to performing a normalized dot product. Fig. 4B then has a generally similar 
histogram plot of the derivatives of the two signals. From pure inspection it can be seen that a 
simple thresholding operation in the derivative transform domain, with a subsequent 
conversion back into the signal domain, will go a long way toward removing certain innate 
biases on the dot product "recognition algorithm" of a few paragraphs back. Thresholding 
here refers to the idea that if the absolute value of a difference signal derivative value exceeds 
some threshold, then it is replaced simply by that threshold value. The threshold value can be 
so chosen to contain most of the histogram of the embedded signal. 

Another operation which can be of minor assistance in "alleviating" some of the bias 
effects in the dot product algorithm is the removal of the low order frequencies by, e.g., high 
pass filtering with a cutoff near the origin. 

Security Considerations 

Security of the present invention depends, in large part, on security of the PRN data 
and/or security of the auxiliary data. In the following discussion, a few of many possible 
techniques for assuring the security of these data are discussed. 

In a first embodiment, each telephone 10 is provided with a long noise key unique to 
the telephone. This key may be, e.g., a highly unique 10,000 bit string stored in ROM. (In 
most applications, keys substantially shorter than this may be used.) 

The central office 14 has access to a secure disk 52 on which such key data for all 
authorized telephones are stored. (The disk may be remote from the office itself.) 

Each time the telephone is used, fifty bits from this noise key are identified and used as 
the seed for a deterministic pseudo random number generator. The data generated by this PRN 
generator serve as the PRN data for that telephone call. 
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The fifty bit seed can be determined, e.g., by using a random number generator in the 
telephone to generate an offset address between 0 and 9,950 each time the telephone is used to 
place a call. The fifty bits in the noise key beginning at this offset address are used as the 
seed. 


site 12, to the central office 14. There, a computer at the central office uses the offset address 
to index its copy of the noise key for that telephone. The central office thereby identifies the 
same 50 bit seed as was identified at the telephone. The central office 14 then relays these 50 
bits to the cell site 12, where a deterministic noise generator like that in the telephone 
10 generates a PRN sequence corresponding to the 50 bit key and applies same to its decoder 38. 


jjj telephone and at the cell site. Accordingly, the auxiliary data encoded on the voice data by the 
J* telephone can be securely transmitted to, and accurately decoded by, the cell site. If this 

auxiliary data does not match the expected auxiliary data (e.g. data transmitted at call set-up), 

hi 

f 15 the call is flagged as fraudulent and appropriate remedial action is taken. 


P information can intercept only the randomly generated offset address transmitted by the 

P telephone to the cell site. This data, alone, is useless in pirating calls. Even if the hacker had 

access to the signals provided from the central office to the cell site, this data too is essentially 
CS0 useless: all that is provided is a 50 bit seed. Since this seed is different for nearly each call 

(repeating only 1 out of every 9,950 calls), it too is unavailing to the hacker. 

In a related system, the entire 10,000 bit noise key can be used as a seed. An offset 

address randomly generated by the telephone during call set-up can be used to identify where, 

in the PRN data resulting from that seed, the PRN data to be used for that session is to begin. 
25 (Assuming 4800 voice samples per second, 4800 PRN data are required per second, or about 

17 million PRN data per hour. Accordingly, the offset address in this variant embodiment will 

likely be far larger than the offset address described above.) 


5 


During call setup, this offset address is transmitted by the telephone, through the cell 


By the foregoing process, the same sequence of PRN data is generated both at the 


It will be recognized that an eavesdropper listening to radio transmission of call set-up 


- 17- 


WYC:dks 60085 01/05/00 


EXPRESS MAIL EL525676538US 


In this variant embodiment, the PRN data used for decoding is preferably generated at 
the central station from the 10,000 bit seed, and relayed to the cell site. (For security reasons, 
the 10,000 bit noise key should not leave the security of the central office.) 

In variants of the foregoing systems, the offset address can be generated by the central 
station or at the cell site, and relayed to the telephone during call set-up, rather than vice versa. 

In another embodiment, the telephone 10 may be provided with a list of one-time seeds, 
matching a list of seeds stored on the secure disk 52 at the central office. Each time the 
telephone is used to originate a new call, the next seed in the list is used. By this arrangement, 
no data needs to be exchanged relating to the seed; the telephone and the carrier each 
independently know which seed to use to generate the pseudo random data sequence for the 
current session. 

In such an embodiment, the carrier can determine when the telephone has nearly 
exhausted its list of seeds, and can transmit a substitute list (e.g. as part of administrative data 
occasionally provided to the telephone). To enhance security, the carrier may require that the 
telephone be returned for manual reprogramming, to avoid radio transmission of this sensitive 
information. Alternatively, the substitute seed list can be encrypted for radio transmission 
using any of a variety of well known techniques. 

In a second class of embodiments, security derives not from the security of the PRN 
data, but from security of the auxiliary message data encoded thereby. One such system relies 
on transmission of a randomly selected one of 256 possible messages. 

In this embodiment, a ROM in the telephone stores 256 different messages (each 
message may be, e.g., 128 bits in length). When the telephone is operated to initiate a call, the 
telephone randomly generates a number between 1 and 256, which serves as an index to these 
stored messages. This index is transmitted to the cell site during call set-up, allowing the 
central station to identify the expected message from a matching database on secure disk 52 
containing the same 256 messages. (Each telephone has a different collection of messages.) 
(Alternatively, the carrier may randomly select the index number during call set-up and 
transmit it to the telephone, identifying the message to be used during that session.) In a 
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theoretically pure world where proposed attacks to a secure system are only mathematical in 
nature, much of these additional layers of security might seem superfluous. (The addition of 
these extra layers of security, such as differing the messages themselves, simply acknowledge 
that the designer of actual public-functioning secure systems will face certain implementation 
5 economics which might compromise the mathematical security of the core principals of this 
invention, and thus these auxiliary layers of security may afford new tools against the 
inevitable attacks on implementation). 

Thereafter, all voice data transmitted by the telephone for the duration of that call is 
steganographically encoded with the indexed message. The cell site checks the data received 
10 from the telephone for the presence of the expected message. If the message is absent, or if a 
different message is decoded instead, the call is flagged as fraudulent and remedial action is 
j{ taken. 

HF In this second embodiment, the PRN data used for encoding and decoding can be as 

uj simple or complex as desired. A simple system may use the same PRN data for each call. 

Such data may be generated, e.g., by a deterministic PRN generator seeded with fixed data 
4* unique to the telephone and known also by the central station (e.g. a telephone identifier), or a 
CI universal noise sequence can be used (i.e. the same noise sequence can be used for all 
p telephones). Or the pseudo random data can be generated by a deterministic PRN generator 

seeded with data that changes from call to call (e.g. based on data transmitted during call set- 

U 

CEO up identifying, e.g., the destination telephone number, etc.). Some embodiments may seed the 
pseudo random number generator with data from a preceding call (since this data is necessarily 
known to the telephone and the carrier, but is likely not known to pirates). 

Naturally, elements from the foregoing two approaches can be combined in various 
ways, and supplemented by other features. The foregoing embodiments are exemplary only, 
25 and do not begin to catalog the myriad approaches which may be used. Generally speaking, 
any data which is necessarily known or knowable by both the telephone and the cell site/central 
station, can be used as the basis for either the auxiliary message data, or the PRN data by 
which it is encoded. 
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Since the preferred embodiments of the present invention each redundantly encodes the 
auxiliary data throughout the duration of the subscriber's digitized voice, the auxiliary data can 
be decoded from any brief sample of received audio. In the preferred forms of the invention, 
the carrier repeatedly checks the steganographically encoded auxiliary data (e.g. every 10 
seconds, or at random intervals) to assure that it continues to have the expected attributes. 

While the foregoing discussion has focused on steganographically encoding a 
transmission from a cellular telephone, it will be recognized that transmissions to a cellular 
telephone can be steganographically encoded as well. Such arrangements find applicability, 
e.g., in conveying administrative data (i.e. non-voice data) from the carrier to individual 
telephones. This administrative data can be used, for example, to reprogram parameters of 
targeted cellular telephones (or all cellular telephones) from a central location, to update seed 
lists (for systems employing the above-described on-time pad system), to apprise "roaming" 
cellular telephones of data unique to an unfamiliar local area, etc. 

In some embodiments, the carrier may steganographically transmit to the cellular 
telephone a seed which the cellular phone is to use in its transmissions to the carrier during the 
remainder of that session. 

While the foregoing discussion has focused on steganographic encoding of the baseband 
digitized voice data, artisans will recognize that intermediate frequency signals (whether analog 
or digital) can likewise be steganographically encoded in accordance with principles of the 
invention. An advantage of post-baseband encoding is that the bandwidth of these intermediate 
signals is relatively large compared with the baseband signal, allowing more auxiliary data to 
be encoded therein, or allowing a fixed amount of auxiliary data to be repeated more 
frequently during transmission. (If steganographic encoding of an intermediate signal is 
employed, care should be taken that the perturbations introduced by the encoding are not so 
large as to interfere with reliable transmission of the administrative data, taking into" account 
any error correcting facilities supported by the packet format). 

Those skilled in the art will recognize that the auxiliary data, itself, can be arranged in 
known ways to support error detecting, or error correcting capabilities by the decoder 38. The 
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interested reader is referred, e.g., to Rorabaugh, Error Coding Cookbook, McGraw Hill, 
1996, one of many readily available texts detailing such techniques. 

While the preferred embodiment is illustrated in the context of a cellular system 
utilizing packetized data, other wireless systems do not employ such conveniently framed data. 
In systems in which framing is not available as an aid to synchronization, synchronization 
marking can be achieved within the composite data signal by techniques such as that detailed in 
applicant's prior applications. In one class of such techniques, the auxiliary data itself has 
characteristics facilitating its synchronization. In another class of techniques, the auxiliary data 
modulates one or more embedded carrier patterns which are designed to facilitate alignment 
and detection. 

As noted earlier, the principles of the invention are not restricted to use with the 
particular forms of steganographic encoding detailed above. Indeed, any steganographic 
encoding technique previously known, or hereafter invented, can be used in the fashion 
detailed above to enhance the security or functionality of cellular (or other wireless, e.g. PCS) 
communications systems. Likewise, these principles are not restricted to wireless telephones; 
any wireless transmission may be provided with an "in-band" channel of this type. 

^ft^j/ill be recognized that systems for implementing applicant's invention can comprises 
dedicated hardwar&^eii^uit elements, but more commonly comprise suitably programmed 
microprocessors with associate^Tfey^and ROM memory (e.g. one such system in each of the 
telephone 10, cell-site 12, and central office MjT^^^ 

Errata 

""^Apglicant is preparing a steganographic marking/decoding "plug-in" for use with 
Adobe Photoshop^oftis^are. The latest version of this software, presented as commented 
source code, is attached as Appfcndi^JB. The code was written for compilation with 
Microsoft's Visual C++ compiler, version>rG^md can be understood by those skilled in the 
art. 
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This source code embodies several improvements to the technology disclosed in 
applicants prior applications, both in encoding and decoding, and also in user interface. 

Applicant's copyrights in the Exhibit B code are reserved, save for permission to 
reproduce same as part of the specification of the patent. 

While the Exhibit B software is particularly designed for the steganographic encoding 
and decoding of auxiliary data in/from two-dimensional image data, many principles thereof 
are applicable to the encoding of digitized audio, as contemplated by the presently claimed 
invention. 

Before concluding, it may be instructive to review some of the other fields where 
principles of applicant's technology (both in this application, and prior applications) can be 
employed. 

One is document security for passports, visas, "green cards," etc. The photos on such 
documents can be processed to embed a subliminal data signal therein, serving to authenticate 
the document. 

Related to the foregoing are objects (e.g. photos and ID cards) having biometric data 
embedded therein. One example of such biometric data is a fingerprint, allowing the 
authenticity of a person bearing such an ID to be checked. 

Another application is smart business cards, wherein a business card is provided with a 
photograph having unobtrusive, machine-readable contact data embedded therein. (The same 
function can be achieved by changing the surface microtopology of the card to embed the data 
therein.) 

Yet another promising application is in content regulation. Television signals, images 
on the internet, and other content sources (audio, image, video, etc.) can have data indicating 
their "appropriateness" (i.e. their rating for sex, violence, suitability for children, etc.) actually 
embedded in the content itself rather than externally associated therewith. Television 
receivers, web browsers, etc., can discern such appropriateness ratings (e.g. by use of 
universal code decoding) and can take appropriate action (e.g. not permitting viewing of an 
image or video, or play-back of an audio source). 
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Credit cards are also likely candidates for enhancement by use of steganographic 
marking, providing an invisible and covert data carrier to extend functionality and improve 
security. 

The field of merchandise marking is generally well served by familiar bar codes and 
universal product codes. However, in certain applications, such bar codes are undesirable 
(e.g. for aesthetic considerations, or where security is a concern). In such applications, 
applicant's technology may be used to mark merchandise, either through in innocuous carrier 
(e.g. a photograph associated with the product), or by encoding the microtopology of the 
merchandise's surface, or a label thereon. 

There are applications — too numerous to detail — in which steganography can 
advantageously be combined with encryption and/or digital signature technology to provide 
enhanced security. 

Medical records appear to be an area in which authentication is important. 
Steganographic principles — applied either to film-based records or to the microtopology of 
documents — can be employed to provide some protection against tampering. 

Many industries, e.g. automobile and airline, rely on tags to mark critical parts. Such 
tags, however, are easily removed, and can often be counterfeited. In applications wherein 
better security is desired, industrial parts can be steganographically marked to provide an 
inconspicuous identification/authentication tag. 

In various of the applications reviewed above and in applicant's earlier applications, 
different messages can be steganographically conveyed by different regions of an image (e.g. 
different regions of an image can provide different internet URLs, or different regions of a 
photocollage can identify different photographers). Likewise with other media (e.g. sound). 

Some software visionaries look to the data when data blobs will roam the datawaves an 
interact with other data blobs. In such era, it will be necessary that such blobs have robust and 
incorruptible ways to identify themselves. Steganographic techniques again hold much 
promise here. 
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Finally, message changing codes - recursive systems in which steganographically 
encoded messages actually change underlying steganographic code patterns ~ offer new levels 
of sophistication and security. Such message changing codes are particularly well suited to 
applications such as plastic cash cards where time-changing elements are important to enhance 
security. 

Again, while applicant prefers the particular forms of steganographic encoding, the 
foregoing applications (and applications disclosed in applicant's prior applications) can be 
practiced with other steganographic marking techniques. 

Having described and illustrated the principles of my invention with reference to 
various embodiments thereof, it should be apparent that the invention can be modified in 
arrangement and detail without departing from such principles. Moreover, a variety of 
enhancements can be incorporated from the teachings of my prior applications. 

Accordingly, I claim as my invention all such embodiments as come within the scope 
and spirit of the following claims and equivalents thereto. 
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